Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation

نویسندگان

چکیده

This paper introduces a novel method to separate voiced frames of noisy speech signals into low-frequency or high-frequency. separation improves the accuracy fundamental frequency (F0) estimators. In this proposal, target signal is analyzed by means ensemble empirical mode decomposition. Next, pitch information extracted from first decomposition modes. feature indicates region where F0 should be located, thus separating The then applied correct candidates detection method, improving estimation accuracy. proposed and baseline approach are evaluated considering four different algorithms. Experiments conducted with CSTR TIMIT databases, six noises various signal-to-noise ratios. Gross Error (GE) Mean Absolute (MAE) metrics adopted evaluate solutions in terms errors. Results show that outperforms baseline, low/high Moreover, solution able better improve under conditions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency

This paper proposes a robust and accurate F0 estimation method for noisy speech. This method uses two different principles: (1) an F0 estimation based on periodicity and harmonicity of instantaneous amplitude for a robust estimation in noisy environments, and (2) an F0 estimation based on stability of instantaneous frequency as an accurate estimation method. The proposed method also uses a comb...

متن کامل

Correlation Based Fundamental Frequency Extraction Method in Noisy Speech Signal

This paper proposed a correlation based method using the autocorrelation function and the YIN. The autocorrelation function and also YIN is a popular measurement in estimating fundamental frequency in time domain. The performance of these two methods, however, is effected due to the position of dominant harmonics (usually the first formant) and the presence of spurious peaks introduced in noisy...

متن کامل

Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features

SUMMARY This paper proposes a robust method for estimating the fundamental frequency (F0) in real environments. It is assumed that the spectral structure of real environmental noise varies momentarily and its energy does not distribute evenly in the time-frequency domain. Therefore, segmenting a spec-trogram of speech mixed with environmental noise into narrow time-frequency regions will produc...

متن کامل

Pitch estimation of noisy speech signals using empirical mode decomposition

This paper presents a pitch estimation method of noisy speech signal using empirical mode decomposition (EMD). The normalized autocorrelation function (NACF) of the noisy speech signal is decomposed into a finite set of band-limited signals termed as intrinsic mode functions (IMFs) using EMD. The periodicity of one IMF is supposed to be equal to the accurate pitch period. A conventional autocor...

متن کامل

Speech fundamental frequency estimation using the alternate comb

Reliable estimation of speech fundamental frequency is crucial in the perspective of speech separation. We show that the gross errors on F0 measurement occur for particular configurations of the periodic structure to be estimated and the other periodic structure used to achieve this estimation. The error families are characterized by a set of two positive integers. The Alternate Comb method use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing

سال: 2022

ISSN: ['2329-9304', '2329-9290']

DOI: https://doi.org/10.1109/taslp.2022.3190670